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It is generally accepted that the asset price processes contain 
jumps. In fact, pure jump models have been widely used to model 
asset prices and/or stochastic volatilities. The question is: is there any 
statistical evidence from the high-frequency financial data to support 
using pure jump models alone? The purpose of this paper is to develop 
such a statistical test against the necessity of a diffusion component. 
The test is very simple to use and yet effective. Asymptotic properties 
of the proposed test statistic will be studied. Simulation studies and 
some real-life examples are included to illustrate our results. 

1. Introduction. It is now widely accepted that the asset price processes 
contain jumps. This is partially based on many empirical evidences, such 
as heavy tails in the asset returns; see Cont and Tankov (2004) and Carr 
et al. (2002) and references therein. In the meantime, many statistical tests 
have been established to detect jumps from discretely observed prices [e.g., 
Jiang and Oomen (2005), Barndorff-Neilsen and Shepard (2006), Lee and 
Mykland (2008), Ai't-Sahalia and Jacod (2010)], and these test results all 
seem to support the claim of the existence of jumps for the asset returns 
under their investigations. 

In recent years, pure jump models have been widely used as an alternative 
model for price process to the classical model, which has a continuous mar- 
tingale component; see Todorov and Tauchen (2010) and references within. 
The idea behind the pure-jump modeling is that small jumps can elimi- 
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nate the need for a continuous martingale. The class of pure-jump mod- 
els is extremely wide. It includes the normal inverse Gaussian [Rydberg 
(1997), Barndorff-Nielsen (1997, 1998)], the variance gamma [Madan, Carr 
and Chang (1998)], the CGMY model of Carr et al. (2002), the time-changed 
Levy models of Carr et al. (2003), the COGARCH model of Kliippelberg, 
Lindner and Mailer (2004) for the financial prices, as well as the non- 
Gaussian Ornstein-Uhlenbeck-based models of Barndorff-Nielsen and Shep- 
hard (2001) and the Levy-driven continuous-time moving average (CARMA) 
models of Brockwell (2001) for the stochastic volatility. Pure-jump mod- 
els have been extensively considered and used for general options pricing 
[Huang and Wu (2004), Broadie and Detemple (2004), Levendorskii (2004), 
Schoutens (2006), Ivanov (2007)], and for foreign exchange options pricing 
[Huang and Hung (2005), Daal and Madan (2005), Carr and Wu (2007)]. 
Other applications of pure-jump models include reliability theory [Drosen 
(1986)], insurance valuation [Ballotta (2005)] and financial equilibrium anal- 
ysis [Madan (2006)]. 

Given the wide usage of pure jump models, a natural question is: is there 
any statistical evidence from the high-frequency financial data to support 
using the purely discontinuous models alone without any continuous diffu- 
sion components? The question is of significance from both theoretical and 
practical viewpoints: 

• Many empirical evidences indicate that pure jump models can fit the data 
well; see, for example, Cont and Tankov (2004), and Carr et al. (2002) 
and references therein. Therefore, it would be of theoretical interest to 
establish some statistical tests for this purpose. 

• Given the existence of jumps, pure jump models are typically easier to 
handle than mixture models in practice, and a preferred choice to mixture 
models for users. However, before using a pure jump model, one must 
check its validity. 

• Various jump models have been well studied in the literature, as mentioned 
earlier. Should we decide to use pure jump models, we would have an array 
of available tools at our disposal. 

• Many results are strongly model dependent, and any model mis-specification 
could have a severe effect on the results. Therefore, it is imperative to 
choose the best possible model, and model selection is very critical. 

To put our question into a mathematical context, suppose that the price 
process Y is a jump diffusion process of the form 



for t G [0, T] with X% and Jt being the continuous and discontinuous (or 
jump) components, defined as 



(1.1) 



Y t = X t + J t 
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(1.2) t t 

Jt= / x(fi — v){ds, dx) + / / x /j,(ds , dx) , 

Jo J\x\<l JO J\x\>l 

where b and a are some deterministic functions such that X has unique 
weak solution, fx is the jump measure, with v its predictable compensator; 
for details on jump diffusion processes, see Jacod and Shiryaev (2003). Under 
this framework, the above question is tantamount to testing 

(1.3) Hq : I <r 2 (X s ) ds > 0, (i.e., diffusion effect is present), 

J o 

(1.4) Hi : / a 2 (X s )ds = 0, (i.e., diffusion effect is not present), 

Jo 

given the jump component Jt is present. Note that, under Hq, Yt is a mixture 
model of diffusion and jumps while, under Hi, it is a pure jump model. 

Cont and Mancini (2007) and Ai't-Sahalia and Jacod (2010) considered 
the above test using threshold power variation. They assumed a general 
continuous semi-martingale form of X as opposed to a diffusion form in 
the present paper. However, to perform their test, one needs to impose the 
condition that J is of finite variation [e.g., Theorem 2 of Ai't-Sahalia and 
Jacod (2010)]. This restriction rules out some interesting models used in 
finance, where the jumps are shown to be of infinite variation, as done in Ai't- 
Sahalia and Jacod (2009), Zhao and Wu (2009) and some other references 
mentioned earlier. 

In this paper, we propose a simple-to-use, general purpose and yet power- 
ful goodness-of-fit test for differentiating a pure jump model from a mixture 
model. The CLTs are also derived for the test statistics under Hq, regard- 
less whether the jump component is of finite or infinite variation. In that 
aspect, our proposed test works more generally than those proposed earlier 
by Cont and Mancini (2007) and Ai't-Sahalia and Jacod (2010). Even for 
the situations where tests by Cont and Mancini (2007) and Ai't-Sahalia and 
Jacod (2010) are applicable, our numerical results also show the superior 
performance of our proposed test. 

The paper is organized as follows. In Section 2, we give some motivations 
via a simple example and then formally introduce our test statistics. Asymp- 
totic results are derived in Section 3. Some review of alternative tests are 
given in Section 4. Numerical studies are given in Section 5. A real example 
is studied in Section 6. Some discussion on microstructure noise is given in 
Section 7. All technical proofs are postponed in the Appendix. 

Throughout the paper, the available data set is denoted as {it^O < i < n} 
in the fixed interval [0, T] , which is discretely sampled from Y. For simplicity, 
we assume that {Y ti ;0 <i<n} are equally spaced in [0,T], that is, t\ = iA n 
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Fig. 1. Smoothed histograms for the increment of the mixture model (- -), pure jump 
model (-), and diffusion term alone (-). From left to right, the sample sizes are 195, 780 
and 23,400, respectively. From top to bottom, a = 0.25 and 0.5, respectively. 



with A n = T/n for < i < n. Denote the jth one-step increment by 



A?Y = Y t 



i-i' 



1 < i < n. 



2. Test statistics. We start with a simple motivating example first and 
then introduce our test statistics for testing (1.3) and (1.4). 

2.1. A simple motivating example. We draw two respective samples {Y^; 
< i < n} from the following two models: 



H : Y t = aW t + S: 







[a mixture model), 



II 



1 • 



Y t = S[ 



[a pure jump model), 



where Wt and are a standard Brownian motion and a symmetric /in- 
stable Levy process, respectively. So the mixture model contains an extra 
continuous component o~Wt, in comparison with the pure jump model. For 
illustration, we take T = 1, f3 = 1.25 and a = 0.25, 0.5. 

The smoothed histograms (done by 10 6 replications) of the increments 
{ A"Y, 1 < j < n} under the two models are plotted in Figure 1 for sample 
sizes n = 195, 780, and 23,400, which corresponds to sampling every 2 min- 
utes, 30 seconds, and every second in a 6.5 hour trading day. From Figure 1, 
we can see some very clear patterns: 

(1) For small sample size n and small a, it is difficult to distinguish the mod- 
els under Hq and H± (the dashed line and dash-dotted line). However, as 
n and/or a increases, the difference is more significant under Hq and Hi. 
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Table 1 

Numbers of increments < aA„ for Y , W and , where a = 2, w= 1 and 
A n = 1/23,400. The numbers are averaged over 500 replications 



Parameter 


#{|A"Y| <aA~} 


#{|A?W| <aA~} 


#{|A?S| <<*A~} 


/3 = 1.50 


408 


488 


942 


= 1.00 


485 


489 


16,491 


13 = 0.50 


487 


487 


23,313 



(2) The differences between the normal histogram and the mixture one (the 
solid line and the dashed line) are small in all cases and become even 
more negligible as n increases. Literally, the jump component has been 
"absorbed" by the diffusion component in the center. 

(3) For fixed a, as the sample size n increases, the differences between mod- 
els under Hq and Hi are getting sharper. Take n = 23,400 and a = 0.5, 
for example. The histogram under H\ (dash-dotted line) shows a very 
narrow peak around the origin, while the histogram under Hq (the 
dashed line) stays rather flat. 

The example shows that there is a huge difference around the origin be- 
tween the models under Hq and Hi . If we use the number of "small" incre- 
ments as an indicator, U n = X^iil^f^l — u n} f° r some u n , then it relies 
heavily on whether the diffusion is present or not, particularly when the 
sample size n gets large. To give a better idea, some values of U n under the 
above two models are presented in Table 1 when n = 23,400. The drastic 
difference for the two models strongly suggests that we might be able to 
use U n to test whether the diffusion is present or not. 

2.2. Test statistics. Let us return to the testing problem given in (1.3) 
and (1.4). We observe from Section 2.1 that the increments from a pure jump 
model and a mixture model have fundamentally different behavior around 
the centers of their distributions. Namely, the distribution for the increments 
from a pure jump model shows a much higher peak in the center than that 
from a mixture model. In other words, the number of small increments from 
a pure jump model is far greater than than that from a mixture model. This 
suggests that we might use the number of small increments 

[T/A n ] 

U(A n )=:U(a,A n ,w,T)= £ I(\A? Y\ < aA%), 

i=l 

to define a test statistic. Note that U(A n ) simply counts the number of 
increments smaller than qA", where a > and w > 1/2. (Here, we suppress 
the dependence on a, w and T for convenience.) 
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Under some mild conditions (given in Section 3), the behaviors of U(A n ) 
are different under Hq and H\. Here is a heuristic argument. Under Hq, we 
have AfF » o-pT^JAJW, and hence 

[T/A„] 

EU(A n )* EP u ^(\^W\<aA^/a(X u ^)) 

i=l 

« 2a0(O)A^ 3 / 2 T / ^^(A^ds, 
J o 

where <^(x) is the density of the standard normal r.v., and Pt i ^ 1 is the 
probability conditioned at time ti—\. Consequently, we have U(A n ) is of 
order A n 3 ^ 2+ro under Hq. Similarly, we can show that U(A n ) is of order 

A -(l+l//3)+ ro under ^ Clearlyj we haye A "3/2+ ro <<; A -(l+l/«+w_ That 

there are far more small increments under the pure jump model (Hi) than 
those under the mixture model (Hq), which agrees well with the above mo- 
tivating example. Then, we will reject Hq (a mixture model) in favor of H\ 
(a pure jump model), if U(A n ) is large enough. 

From both Proposition 1 and (A. 24) in the Appendix, we see that the 
probability limit of U(A n ) depends on unknown population quantities, and 
hence can not be directly used for our testing purposes. To get around the 
problem, we adopt the same strategy as in Zhang, Mykland and Ai't-Sahalia 
(2005) and Ai't-Sahalia and Jacod (2010) by using a two-time scale test 
statistic, 

_ U(A n ) 

Vn -u(kA n y 

where U(kA n ) =: U(a, kA n , w, T) = Ztx^ T (\ A (i-i) k+ i Y + A l Y \ ^ 
a(kA n ) m ). As can be seen from (3.1) below, the distribution of V n is model- 
free under Hq, and we can reject Hq (a mixture model) in favor of Hi (a pure 
jump model) if V n > C for some critical value C > 0. 

We end the section by pointing out some differences between the above 
test and the one by Ai't-Sahalia and Jacod (2010). The test statistic given 
in (16) and (19) of Ai't-Sahalia and Jacod (2010) is based on the truncated 
pth. power variations while our test statistic given by U (A n ) and V n is simply 
based on the number of small increments. Further comparisons will be made 
later in the paper. 

3. Main results. We first list some assumptions and then present the 
main results. 

3.1. Model assumptions. Recall that Yt = Xt + Jt- Assume that Y is 
defined on a filtered probability space (O, T Y , T(), where Tj is the history 
of Y up to time t. 



PURE JUMP MODELING 7 

Assumption 1. Jt has a jump measure fi(dx,dt) with compensator 
v(u),dx,dt) = dtF t (uj,dx), such that, for all (co,t), we have F t = F{ + F", 
where: 



(1) F[ has the form 

F[{dx) = 1 + w\i+p X ' ) [ a(+)/ (° < s < e + ) + a (-) /(-£" < 



for some positive constants a^~\ 7, e + and e~ and some bounded 
function f(x), satisfying 1 + |x| 7 /(x) > 0, \ f(x)\ < L. 
(2) F" is a singular measure with respect to F[, satisfying f R {\xf A l)F['(u, 
dx) < L. 

Assumption 2. X and J are mutually independent. 

Assumption 3. &(•) is a bounded continuous functions, a(-) is bounded 
away from zero and infinity if it does not vanish and cr'(-) exists and is 
bounded. 

Assumption 1 implies that the small jumps of J form a Levy process with 
a ^-stable-like Levy density, while almost no condition is placed on the large 
jumps of J, and F" could even be random. Assumption 1 includes a rich 
class of models, like the variance gamma model, CGMY model, tempered 
stable process, etc. Assumptions 2 and 3 are technical conditions. 

3.2. Asymptotic results. Let Af(0, 1) denote a standard Gaussian random 
variable. We will use the stable convergence in law below, which is slightly 
stronger than weak convergence; see, for example, Jacod and Shiryaev (2003). 

Theorem 1. Suppose that w > (3— 1/2 and that Assumptions 1-3 hold. 

(1) We have 

f fe 3/2-W ) under Hq ^ 

\ / u - 1 +( 1 // 3_ro ) A0 ) under Hi and Assumption 4 below. 

(2) Letk = 2. Under Hq, we have 

At~ 3/2y2 (V n - k z / 2 ~™) — > aM{d, 1) stably, 
where M (0,1) is independent ofY and 

2 _ ( 1+A .3/2- ro) fc3- 2ro 



(3.1) V n 



a' 



2c^(0) ffa- 1 {X s )ds 



10 

To apply Theorem 1, one needs to estimate the unknown a 2 . However, in 
view of Proposition 1 in the Appendix and the stable convergence, we have 
the following. 
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Corollary 1. Assuming the same assumptions as in Theorem 1, we 
have 

A ^-3/2)/2 (K _ fc 3/2- W)/ - _^ Ar(Q; 1} under Hqj 

where 

(1 + ^3/2-^7^3-2^ 

CT A^ /2 - ro f/(A n ) 

From Corollary 1, at significance level 9, we can reject Hq if V^, > k 3 / 2 ~ m + 
z 1 _0A^, //4 ro ' 2 a and P(A/"(0, 1) > £1-0) = 0. It follows from Corollary 1 that 
the size of the above test is asymptotically 9. 

A slight variant of the test statistic V n can be given below. Let 

V 71. 



n U L (2A n y 
where U L (2A n ) = [U(2A n ) + l/'(2A n )]/2 and 
P7(2A„)]-1 

U'(2A n ) = J d A 2,+i y + A 2^l < a(2A n ) ro ), 

i=i 

P7(2A„)] 

f/(2A n )= ^ /(|A"F + A^_ i y|<a(2A n r). 
i=l 

In other words, we use linear combinations of U(2A n ) with different starting 
time points instead of a single U(2A n ) starting from time to when nonover- 
lapping two-step increments of Y are sampled. Similarly to Corollary 1, we 
can easily derive the following result. 

Corollary 2. Assuming the same assumptions as in Theorem 1, we 
have 

A (w-3/2)/2^ n _ 2 3 /*-™)/a — ^(0, 1) under Hq, 

where 

_ 2= U(A n ) + 2 3 / 2 -™U L (2A n )/2 
A^ /2 - ro f/ L (2A n )2 

Our final decision rule is: at significance level 9, we reject Hq if 
(3.2) V n > C, 

where C = 2 3 / 2 ~ ro + Zl - e A n /4 ~ w/2 a and P(Af(0, 1) > Zi-e) = 9. It follows 
from Corollary 2 that the size of the above test in (3.2) is asymptotically 9. 
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Remark 1. The requirement w > ft — 1/2 in Theorem 1 and Corollar- 
ies 1-2 can be easily satisfied by choosing w = 3/2 as ft £ (0,2). Moreover, 
whatever the value of w, Hq and H\ can be differentiated since 1 + 1//3 > 3/2 
for all fte (0,2). 

On the other hand, the behaviors of test statistics S n under Hq and H\ 
in Ai't-Sahalia and Jacod (2010) depend on the choice of p, that is, 2 > p > 
1 V /?; see Theorem 1 in that paper. Ai't-Sahalia and Jacod (2010). Since ft 
is unknown, it is difficult to choose p. To be on the safe side, one might 
try to choose ft close to 2. However, this will render the test with very low 
power since S n converges in probability to roughly the same limit 1 under Hq 
and H\. 

Remark 2. In Theorem 1 and Corollaries 1-2, we have ft G (0,2), and 
no further restriction on ft is imposed, so that the jump component could 
be of finite variation or infinite variation. By contrast, The CLT under Hq 
was developed by Ai't-Sahalia and Jacod (2010) only when ft < 1, namely 
when J is of finite variation. 

3.3. Asymptotic power. Before discussing the power of our test statistic, 
we list one more condition, which basically assumes that the drift term is 
zero when ft < 1. It is a standard assumption in the literature; see Jacod 
(2008) and Woerner (2003), and the references therein. 

Assumption 4. If ft < 1, we assume that £>(•) = 0, and J.^xF'^dx) = 
0. If ft = 1, we assume that £>(•) = and F'(dx) is symmetric about 0. 

The next theorem gives the asymptotic power of our proposed test (3.2). 

Theorem 2. Under Assumptions 1 and 4, with prescribed level 9 and 
for w > 1, we have 

P(V n >C\H 1 )^l, 
that is, the asymptotic power is 1. 

Remark 3. We end this section with some remarks on finite sample 
performance of our test statistics. Intuitively, the closer ft gets to 2, the 
more the pure jump process behaves like a diffusion process; thus, the more 
difficult it is to tell their difference apart, the less power our test will have. 
Similarly, the closer ft gets to 0, the more power our test will have. In fact, 
simple algebra yields C — 2 3 / 2-ro = O p (An +1 ^ ro )/ 2 ) 5 f r0 m which we can 
see that, as ft becomes closer to 0, the power of our test increases soon. This 
is further confirmed in our simulation studies given later. 

4. A review of other approaches. The testing problem considered in this 
paper has also been considered earlier by Cont and Mancini (2007) and Ait- 
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Sahalia and Jacod (2010). Since the work in both papers is similar, we will 
only review the test by Ai't-Sahalia and Jacod (2010) (hereafter AJ's test) 
below. 

The building block of the AJ's test is based on the truncated p-power 
variation, 

[i/An] 

(4.1) B(p,u n ,A n )= ]T \A?Y\*>I(\A?Y\<u n ), 



(4.2) S n 



i=l 

where p E (1,2), and u n satisfies u n j A P n — > 0, u n /An + — > oo, for some < 
p_ < p + < 1/2. Similarly to Zhang, Mykland and Ait-Sahalia (2005), Ait- 
Sahalia and Jacod (2010) defined a two-time scale estimator 

Sn = n^ P:Un, uT\ for an inte S er fc > 2 > 

and showed that 

^ P f k 1 "?/ 2 , under H , 

\ 1, under Hi, if 2 > p > 1 V /3 and p + < (p - 1) jp 

and that when f3 < 1, 

(4.3) (5„ - fc 1 "^ 2 )/^ AT(0, 1) under F , 

where u 2 = CB(2p, u n , A n )/B(p, u n , A n ) 2 for some constant C. Noting 
k l-p/2 > 1 

, one would reject Ho if S n < Co, for some Co determined from the 
CLT. Ai't-Sahalia and Jacod (2010) also showed that the asymptotic power 
of this test is 1. 

We make several remarks regarding the AJ's test: 

• From (4.2), the behaviors of test statistics S n under H$ and H\ depend 
on the choice of p, that is, 2 > p > 1 V f3. Since /3 is unknown, it is difficult 
to choose p. To be on the safe side, one might try to choose (3 close to 2. 
However, this will render the test with very low power since *S*^ converges 
in probability to roughly the same limit 1 under Hq and H\. 

• The CLT under Hq, (4.3), was established in Ai't-Sahalia and Jacod (2010) 
only for the case f3 < 1, namely when J is of finite variation. However, 
when P > 1, that is, when J is of infinite variation, no CLT is available, 
and hence the size of the test cannot be controlled for that case. This rules 
out some interesting applications when j3 > 1. 

• For (3 G (0, 1), where the CLT is available for AJ's test, we might expect 
that AJ's test should have very good power, particularly as /3 gets smaller 
toward 0. However, our simulation studies give some counterintuitive re- 
sults; see Table 6. 

5. Numerical studies. In this section, we conduct simulations to evaluate 
the performance of our proposed test statistics, and make some comparisons 
with that of Ait-Sahilia and Jocod (2010). 
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Table 2 

Sizes of the test (%) under different n 's and fi's, {8 = 2, k = 2) 



Value of f3 


1.2 


1.3 


1.4 


1.5 


1.6 


1.7 


1.8 


1.9 


n = 1560 


2.97 


4.18 


4.71 


4.08 


4.39 


3.99 


4.13 


4.27 


n = 2340 


3.94 


4.42 


4.41 


4.27 


4.55 


4.42 


4.25 


4.37 


n = 4680 


3.98 


4.36 


4.46 


4.82 


5.31 


4.81 


4.93 


4.56 


n= 11,700 


4.34 


4.50 


4.94 


5.06 


4.93 


5.06 


4.78 


4.29 



The test statistics V n and V n involve choosing the threshold level u n = 
aA^ 7 . In view of the requirement w > (3 — 1/2, a conservative choice of w 
would be 1.5. To compensate for the conservative choice of w, we choose 
a relatively large a by a n = 5(logn) K for some positive constants 5 and k. 
This choice will not affect any of the asymptotic results in the paper. 

Assume that the data generating process under the null and alternative 
hypotheses are, respectively, 

(5.1) H : Y t = X t + e'Sp, t , 

(5.2) H x : y t = exp(- 7 t) + 0.5S /3ii , 

where Xt is an Ornstein-Urlenbeck process. dXt = —Xt dt + dWt, and W is 
a standard Brownian motion, and Sp is a symmetric /3-stable process. Let 
T = 1, 6' = 0.5. Also we take n = 1560,2340,4680, 11,700 and 23,400, corre- 
sponding to an intra day data set recorded every 15, 10, 5, 2 and 1 seconds 
in a 6.5-hour trading day, respectively. We will simulate 10,000 samples from 
each model above. 

Asymptotic sizes. Fix the nominal level 9 = 5%, so the critical value is 
•Zo.95 — 1-645. The size of the test is calculated by the percentage of samples 
such that (3.2) holds true over 10,000 samples. 

Table 2 reports the asymptotic sizes for different sample sizes. From the 
table, we see that the type I error is well controlled by 5%; as the sample 
size n increases, the asymptotic sizes become closer to the true size 5%. 

Table 3 reports the asymptotic sizes across different threshold levels which 
reflect the number of effective data. It shows that control of type I error is 
not affected much by changes of 5. 

Asymptotic power. We also consider the power performance of V n . The 
power of the test is the percentage of samples with (3.2) violated over 10,000 
samples. The results are listed in Table 4 for different values of f3. 

From Table 4, it is clear that, as the sample size n increases, the test 
becomes more powerful overall, as expected. The test is powerful especially 
when /3 is away from 2. When f3 approaches 2, the power gradually di- 
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Table 3 

Sizes of the test (%) under different 8 's and P's (n = 23,400, « = 2) 





1.2 


1.3 


1.4 


1.5 


1 


1 7 
J. . i 




1 Q 


8=1.0 


4.64 


4.22 


4.21 


4.68 


4.61 


4.16 


3.98 


4.03 


8=1.5 


4.23 


4.16 


4.65 


4.54 


4.62 


4.97 


4.35 


4.65 


8 = 2.0 


3.94 


4.42 


4.41 


4.27 


4.55 


4.42 


4.25 


4.37 


8 = 2.5 


4.30 


4.62 


4.16 


4.38 


4.47 


4.49 


4.65 


4.08 








Table 4 












Powers of the test (%) under different 


n 's and ft 's 


(8 = 2, 


rc= 2; 




Value of /3 


1.2 


1.3 


1.4 


1.5 


1.6 


1.7 


1.8 


1.9 


n = 1560 


100 


100 


100 


94.10 


48.79 


19.86 


9.85 


5.38 


n = 2340 


100 


100 


100 


91.36 


46.46 


20.53 


10.43 


5.82 


n = 4680 


100 


100 


100 


89.55 


48.52 


21.85 


17.55 


6.14 


n = 11,700 


100 


100 


100 


92.78 


55.33 


26.54 


12.67 


7.13 


n = 23,400 


100 


100 


100 


96.01 


63.80 


31.05 


14.59 


7.34 



minishes. This is easily understandable as in this case the behavior of the 
discontinuous process resembles that of a Brownian motion. This can also 
be seen from (3.1). 

Finally, we examine the asymptotic sizes over different choices of 9'. We 
fix n = 2340, 5 = 2, k = 2 and 9 = 5%. In Figure 2, the asymptotic sizes for 
ft = 1.25 and 1.5 are plotted against 9' . Clearly, the asymptotic sizes are not 
sensitive to choices of 9' . 

Comparisons with AJ's test. Now we compare the performance of our 
estimator V n with that of AJ's estimator S n , under the same settings as 
in (5.1) and (5.2). However, since AJ's test is only shown to be valid for the 
case ft € (0, 1) (i.e., the jump process is of finite variation), our comparisons 
are also restricted to that case. Tables 5 and 6 report the sizes and powers 
of our test and AJ's test for various values of ft € (0, 1), respectively. 

0.06 
0.055 
0.05 
0.045 
0.04 

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 

Fig. 2. Sensitivity plot of asymptotic sizes to choices of 8' . 



beta^L 25 
beta l= 1.50 
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Table 5 

Size of our test v.s. that of AJ's test, (%), 6 = 2, k = 2 









0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


n = 


1560 


v n 


4.19 


3.81 


4.18 


3.90 


4.08 


3.84 


3.62 


3.59 


4.06 


4.11 






AJ 


4.08 


4.23 


4.22 


3.94 


3.99 


4.29 


4.24 


4.10 


4.37 


4.41 


n = 


2340 


Vn 


4.31 


4.37 


3.97 


3.98 


4.16 


4.10 


4.18 


3.93 


4.11 


4.32 






AJ 


4.36 


4.33 


4.23 


3.89 


4.67 


4.47 


4.61 


4.29 


4.54 


4.47 


n = 


4680 


Vn 


4.07 


4.38 


4.01 


4.57 


3.87 


4.24 


4.34 


4.18 


4.52 


4.43 






AJ 


4.52 


4.53 


4.33 


4.64 


4.56 


4.43 


4.41 


4.71 


4.89 


4.76 



• For both tests, all sizes are close to to the nominal level, 5%, with AJ's 
test being slightly closer overall. 

• Our test outperforms AJ's in terms of power throughout. In fact, our test 
has full power for all (3 6 (0,1], even for sample size n = 1560. On the 
other hand, AJ's test has very low power in detecting the alternatives for 
(3 < 0.7, even when n = 23,400. 

The very low powers of AJ's test for small (3 came as a surprise to us. Some 
more detailed analysis suggests that the reason might be due to the large 
variation of S n for finite sample size n under H\. More precisely, from (69) 
in A'it-Sahalia and Jacod (2010), we have S n = Op(u n ) under H±. So for 
finite sample n, S n may not be close to for small /3, which often places 
the test statistic S n wrongly within the acceptance region, resulting in low 
power. It also explains why the problem is mostly pronounced if (3 is closer 
to 0. 

Figure 3 displays the histograms of the studentized S n as given in (4.3) 
when n = 4680. We see that values of studentized S n 's are seldom less than 
-20.05 = —1-645, except in the case (3 = 1. 



Table 6 



Power comparisons 


of our test v.s. that 


of AJ's 


test when (3 G (0, 1], (%), 8 = 2, 


K= 2 




P 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


n = 1560 


V n 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 




AJ 

















0.01 


0.25 


2.03 


7.61 


24.03 


n = 2340 


Vn 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 




AJ 











0.01 





0.07 


0.43 


3.32 


14.82 


48.36 


n = 4680 


Vn 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 




AJ 

















0.08 


0.97 


8.66 


48.26 


97.02 


n = 23,400 


Vn 


100 


100 


100 


100 


100 


100 


100 


100 


100 


100 




AJ 

















0.30 


6.40 


85 


100 


100 
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-0.9 -0.8 -0.7 -0.6 -0.5 -0.4 -0.3 -0.2 -1.4 -1.2 -1 -0.8 -0.6 -0.4 -0.2 0.2 0.4 





-2 -1.5 -1 -0.5 0.5 -3.5 -3 -2.5 -2 -1.5 -1 -0.5 



Fig. 3. Upper left panel: f3 — 0.25; upper right panel: j3 = 0.50; lower left panel: f3 — 0.75; 
lower right panel: /3 = 1.00. 



Sensitivity to model misspecification of our test. In the model assump- 
tions, we assumed a local volatility function. Now we conduct a simulation 
study to check the sensitivity of our test to model misspecification; see Fig- 
ure 4. Instead of using an Ornstein-Urlenbeck process as the continuous part 
of the full model, we use a stochastic volatility process here, that is, dXt = 



at dWt with at 



,V2 



dv t = k(t] - v t ) dt + jv} /2 dB t , E[dW t dB t ] = pdt. We 



take i] = 1/16, 7 = 0.5, k = 5, p = —0.5. We use O'Spj as the jump process 
as in last two simulations. Now we fix n = 23,400, 6 = 1, k = 2, 8' = 0.25 and 
8 = 5%. All simulations are run 10,000 times. From Figure 4, the asymp- 
totic sizes are not much affected by using a stochastic volatility model as 
the continuous part. 



6. A real data set analysis. In this section, we implement our test to 
some real data sets. We use the stock price records of Microsoft (MFST) in 
there trading days, Nov. 1, Dec. 1 and Dec. 11 in the year 2000. All data 
sets are from the TAQ database. For prices recorded simultaneously, we use 
their averages. To weaken the possible effect from microstructure noise, we 
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0.03 1 1 1 1 1 1 1 1 1 1 

0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 

Fig. 4. The data generating process is the combination of a stochastic volatility process 
and a standard symmetric stable process. 



sparsely sample observations every 10 seconds and the sample sizes for the 
aforementioned three trading days are 1343, 1701 and 1253, respectively. 
Finally, we take the logarithm of the sparsely sampled prices and use the 
log prices to calculate the test statistics. We set T = 1 (day) consisting of 
6.5 hours of trading time. 

We now discuss how to choose the parameters 5, k and w. As argued 
theoretically at the beginning of Section 5, we fix w = 1.5. Since k and 8 are 
dependent parameters, we fix k = 2 and consider a grid of points of 5 such 
that 

(6.1) 5(logn) K A-<a*Ay 2 , 

where a* is approximately the averaged standard deviation of the diffusion 
component of one 10-second log return in case the diffusion term exists in 
the underlying dynamics. Mathematically, a* is defined as 

a* 2 =: i£(A?X) 2 I(|A?X| < -4 P ± £ o*(X.)ds; 

see Jacod (2008) for example. In virtue of (6.1), we can choose 5 conserva- 
tively as the grid points from 1 to 8 with equal step length 0.1 for all three 
data sets. The plots are displayed in Figure 5. 

From the plots, the observed test statistics are all larger than 1.645. There- 
fore we can reject the existence of the diffusion component and simply use 
a pure jump model to characterize the underlying dynamics of the prices for 
those three days. 




123456781234567812345678 



Fig. 5. The statistics evaluated over different values of 5. From left to right: test statistics 
for 01, Nov., 01, Dec. and 11, Dec, respectively. The horizontal axis stands for the value 
of 5 while the vertical axis stands for the value of the test statistics. 
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7. Discussions on microstructure noise. It is widely accepted nowadays 
that microstructure noise is present. Various methods have been studied to 
handle the issue of the microstructure noise in the context of the integrated 
volatility estimation for high-frequency data. See, for example, Ai't-Sahalia, 
Mykland and Zhang (2005), Zhang, Mykland and Ait-Sahalia (2005), Zhang 
(2006), Fan and Wang (2007), Podolskij and Vetter (2009) and Jacod et al. 
(2009), among others. A very effective technique in handling microstructure 
noise is the so-called "pre- averaging method"; see Jacod et al. (2009) and 
Podolskij and Vetter (2009). 

Suppose that the observation at time ti is 

Z u =Y u +e u , i = l,...,n, 

where Yf is an unobserved semi-martingale of the form (1.1) and (1.2), 
and £t t with mean and variance a 2 is the microstructure noise at time ti. 
We wish to test (1.3) and (1.4), that is, whether Yj can be modeled as a pure 
jump process, or not. 

So far, we have not seen any work in the testing framework in the pres- 
ence of microstructure noise. We now apply the simplest pre-averaging tech- 
nique as follows. We first separate the full data set Z^, 1 < i < n into n/M 
nonoverlapped blocks, 

{Z tl Z tM }, {Z tkM+1 , . • • , Z( k+ i) M }. 

Then within each block, we take the average of all JC-step increments, that 
is, 

(k+l)M 

^ j = nlM-K + l ^ ^ ~ Zt ^'- = X i + 1 3 + g J ' 

' i=kM+K+l 

j = l,..., n/M. 

Simple calculation yields Xj = O p (M^ 1 / 2 ),J ) = O p (M~ 1 ^),e = 
O p ((M/n) 1 / 2 ). By properly tuning M, for example, M = o(n 1//2 ), one could 
make £j asymptotically negligible. Based on the modified data set Z%,..., Z^j, 
the test statistics can be defined (similarly to V n ) as 

tt _ U(A M ) 



u(kA M y 

where Am = T/M, U (Am) and U (fcA^) are defined as U (Am) and U (HAm) 
by replacing Yj with Zi and by replacing A n by Am, for example, U(Am) = 

y:f =1 /(i^i< a (A M n. 

Under appropriate conditions, the results obtained in the paper should 
be expected to hold here as well, for instance, 



(7 1) V 1 fcl ' 5 



p \ k , unuer hq, 

fc i+(i/^-w)A0 under Fi. 



300 
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Let us conduct a simple simulation study to investigate the feasibility of 
the test statistic V n . Take Y t = Wt + St under Hq and Y t = St under Hi, 
where Wt and St are a standard Brownian motion and a symmetric Cauchy 
process (i.e., /3 = 1), respectively. Also take a 1 ~ N(0,a 2 ) with a = 0.01. We 
let T = 1, n = 23,400 and k = 2. We further take M = 234, K = 50, a = 9, 
w = 1.5. Note that the choice of M = 234 corresponds to taking averages 
about every 4 minutes. The simulation is repeated 5000 times. Each time, 
we calculate V n both under Hq and H\ . Their histograms under Hq and Hi 
are plotted in Figure 6. 

From Figure 6, we see that the means of V n under Hq and H\ (marked 
by * in the horizontal axis) are 1.0578 and 1.4781, respectively. These are 
rather close to the asymptotic values 1 and 1.414, given by (7.1). Note that 
the effective sample size after pre-averaging is 23,400/234 = 100, a rather 
small sample size for this testing purpose. This explains partly why the 
variances of histograms plots are rather large, and there are substantial 
overlaps between the plots under Hq and H\. If we choose M = 120, or even 
60, then the histograms under Hq and H\ will become thinner and more 
easily separable. 

The above simple simulation study suggests that the pre-averaging method 
would work well in handling microstructure noise in the testing problems. 
Of course, there remain many theoretical and practical issues to be resolved. 
For example, we need to establish a CLT under Hq; to study its asymptotic 
power; to find a data-driven method to determine parameter M, etc. We 
will pursue these and other related issues in our future work. 



APPENDIX 

In the sequel, C will denote a constant which may take different values 
in different places, and x is an arbitrarily small positive number. Also, Pt i _ 1 
and E ti _ 1 denote probability and expectation given time respectively. 
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A.l. Proof of Theorem 1. Let a\ = 2a<£(0) J T cr -1 (X s ) ds. Now 
A ^-3/2)/ 2(K _ £.3/2-^ 

_ A^T 3/2)/2 (t7(A n ) - g 2 ) - fc( 3 / 2 — )/ 2 (fcA w )("~ 3 / 2 )/ 2 (L>(fcA w ) - a 2 ) 

k™~ 3 / 2 U(kA n ) 

A 

'~ B' 

By Proposition 1 (below), A — > b uqZ\ — <t Z2 with zi and z 2 in- 

dependent Gaussian random variables independent of J- Y , while B -^- p 
fcm-3/2 a 2^ w \ 1 [ c i l j s ra ndom but depending only on T Y . Then Theorem 1 
is proved. 

Now, we prove Proposition 1, in which we need the following two lemmas. 
Lemma 2 implies that the proportion of paths of a jump diffusion process 
having "small" increments is the same as that of the diffusion component. 
This has its own interest. 

Lemma 1. Let A» = {co : \A?X + x\ < aA™}. 

(1) For \x\ < A l J 2 , \P t ^ (A*) - 2 ^}TlZ 1 ^ CA n(x 2 A- 3/2 + A"*). 

(2) For any x G R/{0}, we have Pt^Ai) < CA^~ 1/2 . 

Proof. Define f(x) = J* a~ 1 (y)dy, then /'(x) = a~ 1 (x) and /"(x) = 
— a' (x) / a 2 (x) . So /(x) is strictly increasing. Let E t = f(X t ), or equivalently, 
X t = f~ 1 (B t ). By Ito's formula, 

Let T[ = Tt+U-x where t%-\ is the (i — l)th observation time defined at the 
end of the Introduction, and Wt = Wt+t^ > t > 0. It is easy to see that W is a 
martingale under (f2, J 7 , J 7 /, Pj^j ) with quadratic variation t. Thus by Levy's 
characterization theorem, Wt is a Brownian motion under (f2, J-, T[, Pt^)- 
By the Girsanov theorem, there exists a probability measure Qt i _ 1 , locally 
equivalent to Pt^ , satisfying 



(A.2) dQt ^ 



dP t , 



exp f - jT 6(X s+ti _ 1 ) dW s - \ J* b 2 (Xs+t^J ds^ , 



such that St + t i _ 1 , t > 0, is a Brownian motion under Qt i _ 1 - 
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Now 



(A.3) 



P u _, (A) = P U -i (-x - «A- + X ti _, < X ti <-x + aA™ + X U _J 
= P ti _ 1 {f{-x + X ti _ 1 -aA™) 

<f(X ti )<f(-x + X ti _ 1 +aAZ)) 

= Pu-i(U<Eu -Sti-i <Ui), 

where k = /(-x + X^ -aA^)-/(X t ,_ 1 ) and m = /(-i+^+oAJ) 
f(Xt i _ 1 ). Taking t = ti — in (A. 2), we then have 

*'i — 1 



(A.4) 



exp 



&o/- 1 (5 s )dW s 

ti-l 

1 /*** — 



+ \Su Q°f~ 1 W 3 J ds ) d Qt i - 1 . 



Since | exp(a?) — 1| < 2\x\ for \x\ < log 2, by boundedness of the diffusion 
coefficient, Levy's theorem of continuity modulus and change of time, 

\p u _Mi) - (*(«i/v^) - ^(/i/v 7 ^))! 

for any arbitrarily small x > 0. On the other hand, by the mean value the- 
orem, 

(A.5) *(uiA/50 " *&/>/S0 = <M£)^7^ = 2aAr 1/2 ^, 

where £ G -^=(7^,^) and 77 G (-1 + X t . j - aA%, -x + Xu^+aAg). Then 
as A n 0, by Assumption 3, we have 

(A.6) K^-at-x + AVJI^CA- 

Since as n large enough < C|x| and < C|x| which yields that £ G 
^W-C|x|,C|x|). Then, since <f>'(0) = and is bounded, 

(A.7) 0(O)| <C{x?A-\ 

The combination of (A.3)-(A.7) completes the proof. □ 

Lemma 2. Let B { = {lo : \AfY\ < aA™}. Then, 



2acj)(0) 



A07-1/2 
L — *n 



< c(A^- 1 /2+i-^/2 + a^-x). 
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Proof. We write 



(A. 



_ + 

|a;|<-v/A^ J\x\>y/A^ 



P ti _AMX + x\< aA%) dp*,., (A™ J < x) 



Since J is purely discontinuous, we can take the exponent in (64) of Ai't- 
Sahalia and Jacod (2009) as 1/2, and then by Lemma 1, 

(A.9) P i>2 < CA^~ 1/2 P ti _ 1 (|A™J| > V^) < C&Z-VW-M 2 . 



By Lemma 1, 
(A.10) P lyl = [ 

J\x 



2a^(0)A^ 



-1/2 



Similarly, we can obtain 



dP ti _ 1 (A?J<x) + R n ,,, 



\Rn,i\ < / C[(x) 2 Ar 3/2 + Ar x ] dP*.! (A? J < x) 

J\x\<</5^ 

(All) = CAr 372 ^^ (A"J) 2 /(|A"J| < V^) + * 

Since \x\ < \/A n , x + A^ _ 1 ) — a{Xt i _ 1 )\ < Cy/A n , by boundedness of 
the diffusion coefficient, 

_2a0(O)Ar 1/2 

(A.12) 



1 



1 



<CA- 

On the other hand, as in (A.9), we have 
2a0(O)Ar 1/2 



ai-x + Xt^) a{X u _,\ 



dP u ^J<x) 



(A.13) 



*(*t,-l) 

< mo-l/2+l-j3/2 
— ^ '- 1 n 

Combining (A.10), (A.12) and (A.13) gives 



/ ^(A^^^-l) 



Pi,l 



2a(j){0)An 



which together with (A.9) completes the proof. □ 

Define U{A n ) = A 3/2_ro [/(A n ), and so U(kA n ) = (£A n ) 3 / 2 " ro [/(AA n ) 
Then we have 
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Proposition 1. We have 

At~ 3/2)/2 (U(A n ) - alk^W*[U(kA n ) - 4}) 
— >■ co(zi, Z2), JF Y -stably, 
where z\ and Z2 are two independent Gaussian variables independent of J rY . 

Proof. Without loss of generality, assume k = 2. Denote Ii = J(| A™ Y\ < 
aA"). In view of Lemma 2, 



(A.14) 



PT/An] 

i=l 



<C7(A^/ 2 + AV 



Since x could be made arbitrarily small, and w > /3 — 1/2, or equivalently, 
l-/3/2>^^, 

[T/A„] 

(A.15) A( ro - 3 / 2 )/ 2 (L>(A n )-a 2 ) = A3/ 2 -W2 £ (/< -i^) + (i). 

i=l 

Now the summands in (A.15) are centered martingale difference sequences 
w.r.t. J~ti-.ii 1 <i < PVAn]- In view of Lemma 2, and making use of (A.14) 
again, 

[T/A n ] [T/A„] 
A 3/2- ro £ ^_ i (/ i -^_ i / i ) 2 =A3/ 2 — £ E ti _ 1 j i + 0P (1) 
i=l j=l 

= ^ + o P (l). 

Since the indicator function is bounded, the Linderberg condition for the mar- 
tingale central limit theorem holds automatically. Then by (A.14) and (A.15), 

(A.16) A^ 3 / 2 )/ 2 (C/(A n ) - a 2 ) -> a QZl 

.F^-stably if the following holds [c.f. Theorem IX 7.28 of Jacod and Shiryaev 
(2003)]: for any bounded martingale N € T 



Y 



[T/An] 

(A.17) A (3/ 2 —)/ 2 £ E t ._ 1 {K>N)I i ^ p Q. 

i=l 

Since T Y =JF X V J- J , it suffices to show (A.17) with iV replaced by X and 
iVi £ J 7,7 , respectively, where N± is a bounded martingale. By Levy's theorem 
of continuity modulus, (A.14) and w > 1/2, 

PVAn] 

A (3/2- ro) /2 £ ^_ i(A n X)/ . 
i=l 
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(A.18) 

< CA(f/ 2 — )/ 2 +V2-x £ ^ J. _>P o. 

i=l 

Next, by independence of X from J 7 ^ and Lemma 1, 

[T/A n ] [T/A„] 

(A.19) A^ 2 "^ 2 i? ti _ 1 (A™iV 1 )/ l <CA^ 2+1 /4 £ £7 t ._ i |Af iVi | . 

i=l i=l 

By Cauchy-Schwarz and Jensen's inequalities, the orthogonality of the mar- 
tingale increments, the expectation of the right-hand side in (A.19) is 

/[T/An 



/I'-i'/Anl \ 



T A / [T/A ' l] \ 
(A.20) < CA-/ 2 +V4^ £ ^(AfiV^J 

< CAt- 1/2)/2 yjE(N ltT -N li0 y. 
Since > 1/2, (A. 17) holds. Similarly, we can deduce that 
(A.21) {kA n )^~ ^l 2 {U{kA n ) - o"q) -> <r *2 

J-^-stably. Finally in view of (A. 16) and (A.21), and by virtue of Lemma 2 
and (A. 14), to complete the proof, it suffices to show that 



(A.22) 



[T/kA n ] / i+k-l 

A 3/2- ro J- E ti Al(\Al k Y\<a(kA n r)Yll(\ A ] Y \^ aA n 

i=l \ j=i 



where A™ k Y = YljLi 1 Y- To this end, we give an estimate of the sum- 
mands in (A.22). Let Af~ j Y = A" fc Y - A"Y, A^~ = E/=/ A F y and 
A Tk + = Szij+i A F^' f° r 2<j<i + A; — 1. We make the convention that 
A Tk~ = ^l'k +k ^ + = 0- Then there exists a constant C such that 
{|A^Y|<a(fcA n nn{|A?y|<aA-} 



3 

C {| A n ^Y\ < CA™} n {\A]Y\ < aA™}, 



and consequently, in view of k = 2, we have 

E ti _J{\Al 2 Y\ < a(2A n r)I(\A]Y\ < aA% 
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< E^Ml^tYl < CA%)E tj _J(\A]Y\ < aA%)] 

(A.23) 

+ E ti _ 1 [I(\A]Y\ < aA^)E tj I(\A^+Y\ < CA™)} 

< CA 2 ™- 1 (by Lemma 2). 

Substituting (A.23) into the left-hand side of (A. 22), we deduce that the left- 
hand side of (A. 22) is less than CAn ^ 2 • Since w > 1/2, (A. 22) is proved. 

□ 

A. 2. Proof of Theorem 2. We start with the proof of the following equa- 
tion which is implied by Lemmas 3 and 4: 

(A.24) A^+( 1 /^ ro ) A0 C/(A n ) -^ p 2aCp under Hi, 

where Cp is some constant. Then Theorem 2 is a straight consequence 
of (A.24), since now V n ^ p 2 1+1 / /3 " ro > 2 3 / 2 ~ ro and C ^ p 2 3 / 2 ~ ro . 

Now X vanishes to a deterministic drift satisfying dXt = b(Xt) dt. For 
simplicity, we assume that e~ = e + =: e. Then Y admits the following de- 
composition: 

Y t = X t + / / x(p, — u)(dx, ds) + / / x/j,(dx,ds) 

JO J\x\<e JO J\x\>e 

/ xF"{dx)ds 

Je<\x\<l 

:= X t + J\ } t + J2,t + Js,t- 

The next lemma reveals that the count of small increments has almost 
nothing to do with the large jumps. 

Lemma 3. Under the conditions in Theorem 2, 

< aA%) - P u _A\^(y -J2)\< «A-)| < CA n . 

Proof. Let M t = ^2 0<s<t I(\A s Y\ > £ )- T nen M is a Poisson counting 
process with uj wise time dependent intensity function J\ x \ >E Ps(dx) and 

(A.25) Pt^iAfM >l)<l-exp(- /* / F^' (dx) ds) < CA n . 

V Jti-1 J\x\>E J 

Notice that on A"M = 0, AfY = A"(Y - J 2 ), so the difference within the 
absolute value sign is 

E tj _ 1 [/(|ArF|<aA-)-J(|Ar(F-J 2 )|<aA-);A?M = Oor > 1] 

(A.26) 

= B k _ 1 [I(\AfY\ < aA-) - I(\A?(Y - J 2 )| < <); A?M > 1]. 
Lemma 3 is a consequence of (A.25) and (A.26). □ 
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Lemma 4. Under Assumption 4, 

P u _, (|A?(X + J x + J 3 )| < aA£) = ^A^-V^ao + 0p(Ar i/Py 

Proof. Let ^ = -aA^~ 1//3 - A r 7 1//3 (A"A + A"J 3 ) and u; = aA"" 1//3 - 
An 1 ^ (AfX + A™J 3 ). The required probability is equal to 

(A.27) P^JiKA-^Af^Kui). 

Now we prove the lemma in two cases: (i) /3 > 1 and (ii) (3 < 1. 
Case (i): /? > 1. By the Levy-Khintchine formula, 

75^ exp^A^A? Jx) = exp^^A" 1 ^)), 

where tp(u) = f R {ex.p(iuy) — 1 — iuyl(\y\ < l)}F'(dy). By a change of vari- 
able, we have 



(A.28) 



Hence, 



ij{A- l ^9) = [ (exp(^z) - l-i9zl(\z\ < 1))F' (A^ 13 z)A 1 J 13 , 
Jr 

- [ i0zl(l<\z\< A- 1 IP)F'{A 1 JP Z )A)IP dz. 

JR 



A n F'(A^z)A^ -> 1 ( fl (+) 7(2 > 0) + a^I(z < 0)) := v{z). 

By the dominant convergence theorem, Eexp^OAn 1 ^ AfJi) converges to 

(A.29) / (exp(i0z) - 1 - i9zl(\z\ < l))9(z) dz + i0/(p - l)(a (+) + a ( " } ). 
Jr 

Therefore we have A~ 1//3 A™ J x converges in distribution to a stable random 

variable. Since (AfX + Af J 3 )A^ 1//3 = o(l), by (A.27) and Assumption 4, 
the lemma is proved in this case. 

Case (ii): (3 < 1. In this case, we can further decompose Ji as follows: 

J\= — I I xu(dx,ds)+ / / xfi(dx,ds) := Jn + J\2- 
JO J\x\<e Jo J\x\<e 

Then the required probability in (A.27) could be rewritten as 

(A.30) P^k - A'^AfJu < A-^AfJ 12 < Ul - A-^AfJ 12 ). 

By similar calculation to (A.29), one gets A^'^AfJ^ converges to a stable 
random variable. First, consider the case where w > 1/(3. Now by (A.30) and 
Assumption 4, the lemma is obtained straightforwardly. Second, if w < 1//3, 
by (A.30), the required probability is asymptotically a constant. □ 
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By Lemmas 3 and 4, 

A l+(l//3- ro )A0^£ ti _ i/( | A?y | < _^P C ^ 

i=l 

which implies that the conditional variance goes to zero in probability, since 

n 

^A%i+0-/f>-*>)*°) Et ._ 1 l*(\A»Y\<aA%) ^ p 0. 

Therefore, a direct use of Lenglart's inequality yields (A. 24). 
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